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METHOD AND SYSTEM FOR DISAMBIGUATION AND PREDICTIVE RESOLUTION 

RELATED APPLICATIONS 
[0001] This application is related to United States Patent 



System for a Data Entry Interface" by Van der Hoeven filed of 
an even date (Attorney Docket No. FORTllOO) . All applications 
cited within this paragraph are fully incorporated herein by 
reference . 
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TECHNICAL FIELD OF THE INVENTION 

002] The invention relates in general to data entry, and more 

particularly, to interfaces and logic for the efficient entry 
of data into devices . 
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BACKGROUND OF THE INVENTION 
[0003] Since the dawn of the computer age one of the most steady and 
continuous trends has been toward miniaturization. This trend 
manifests itself in every facet of the industry. Cell phones, 
personal digital assistants (PDAs), portable computers etc. 
have all decreased in size at astounding rates. Conversely, 
however, as these devices have shrunk they have also become 
more powerful. This dichotomy is somewhat problematic. The 
need to communicate information to, and glean information 
from, a user increases commensurately with the power of a 
device, while the real estate available for these display and 
data entry interfaces naturally shrinks along with the device. 

[0004] With each of the two subsystems of data entry and display, 
there are inherent physical limitations on possible size 
reduction before each subsystem becomes either ineffective or 
useless. The physical limitations placed on communications to 
a user are much greater than those placed on an interface for 
data entry. These limitations may include hardware 
limitations, physiological limitations, and the fact that, in 
most cases, data must be communicated to a user visually. 
Therefore, the majority of a device's footprint reserved for 
communications and data entry may be taken up by a visual 
display. This visual display leaves little room for an 
interface for data entry. 

[0005] Solutions to address this problem have been devised by 

incorporating handwriting software in a device. To utilize 
this solution, a user may directly enter text by writing on a 
touch sensitive display. This handwritten data is then 
converted by handwriting recognition software into digital 
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data. While this solution allows one display to both 
communicate information to, and receive data from, a user, the 
accuracy and speed of handwriting recognition software is 
notoriously bad. Additionally, printing or writing with a 
stylus is generally slower than typing, or key based input, 
and requires the use of two hands . 

[0006] It is therefore advantageous to use an interface for entering 
data which is small and may be operated with one hand. Prior 
attempts at such a keyboard have resulted in the development 
of a keyboard that has a reduced number of keys. FIGURE 1 
shows a prior art reduced keypad 100 for a device. The keypad 
100 is a standard telephone keypad having ten number keys 110- 
128 an asterisk (*) key 130, and a pound (#) key 140. For 
English, and many other alphabet languages, the numeric keypad 
of the telephone is overlaid with an alphabet keypad where 
three letters are associated with each number key 110-128. 
For example, the two (2) key 114 is associated with the 
letters a-b-c. With some of these reduced key devices, a user 
may employ multiple key presses to enter words or names. 

[0007] However, this solution is problematic as well, as Chinese and 
other character based languages such as Japanese kanj i do not 
have a manageable number of letters which can be overlaid onto 
a numeric keypad. An additional problem is that each key 
press may contain multiple characters, creating ambiguity in a 
sequence of keystrokes . 

[0008] A solution to the problem of ambiguity is to have the user 
enter two or more keystrokes to specify each letter, the 
keystrokes may be entered simultaneously (chording) or in 
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sequence. However, these approaches require many keystrokes 
and are consequently inefficient. Ideally, a data entry 
method would require one keystroke per letter. 

[0009] In the past, word level disambiguation has been used to 
disambiguate entire words by comparing the sequence of 
received keystrokes with possible matches in a dictionary. 
But, because of decoding limitations, word level 
disambiguation does not give error free decoding of 
unconstrained data with an efficiency of one keystroke per 
character. 

[0010] Thus, there is a need for apparatuses, methods and systems for 
interfaces which enable the efficient one-handed entry of a 
variety of data, and the ability to reduce the number of 
strokes required for data entry by using logic for the 
resolution of character ambiguity and the predictive 
completion of input. 
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SUMMARY OF THE INVENTION 
[0011] Apparatuses, methods and systems for the efficient input of 



text on a device with a restricted set of input channels are 
disclosed. These apparatuses, systems and methods may employ 
a user interface which allows efficient one handed operation 
by using a key layout that may be manipulated by the thumb of 
a single hand. This interface may be tailored to the specific 
device on which it is utilized, and in some instances 
multiple characters may be associated with each key or portion 
of a key. To assist in the efficient entry of data, these 
apparatuses., systems and methods may disambiguate user input 
in the case where data entered by the user may have several 
interpretations, provide an efficient way to specify the input 
exactly, or predict user input and submit these predictions to 
the user. These predictive systems and methods may also adapt 
to the proclivities of a certain user, and incorporate 
abbreviations, translations, and synonyms for frequently used 
inputs . 



[0012] In one embodiment, a candidate may be formulated based on a 



prefix and an input, a prediction may be formulated based on 
the set of symbols associated with the candidate and a 

proposition formulated based on the candidate or the 
prediction. 



[0013] In similar embodiments, the prefix is empty, entered by a 
user, or a previous proposition. 

[0014] In other embodiments, the score of the candidate is based on a 
score of the prefix concatenated with the input. 

Gray Cary\AU\4 124572 . 1 
2504036-991110 



ATTORNEY DOCKET NO. 
FORTH 10 



- 8 - 



PATENT APPLICATION 
Customer ID: 25094 



[0015] 



[0016] 



[0017] 



[0018] 



[0019] 



[0020] 



[0021] 



[0022] 



[0023] 



In still other embodiments, the score of the candidate is 
based on a score of the input combined with the score of the 
prefix. 

In yet other embodiments, the set of candidates is ranked 
based upon the score of each of the candidates . 

In another embodiment, a candidate is formulated based on a 
language model . 

In various other embodiments, the score of the input is based 
on a string which is the longest string in the language model 
which is a extension of the prefix. 

In still another embodiment, a prediction is formulated based 

on a usage model . 

In some embodiments, the prediction is a sequence 
corresponding to a path through a tree starting from a node 
based on the candidate and ending with a terminal node. 

In other embodiments, formulating a prediction is based on 
scoring each of the predictions based on frequency of 
appearance and length of the path through the tree. 

In one embodiment, a set of synonyms is determined for the 
prediction. 

In an additional embodiment, a set of synonyms, a set of 
predictions and a set of candidates are stored, the set of 
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synonyms, the set of candidates, and the set of predictions 
are ranked. 

[0024] These, and other, aspects of the invention will be better 

appreciated and understood when considered in conjunction with 
the following description and the accompanying drawings. The 
following description, while indicating various embodiments of 
the invention and numerous specific details thereof, is given 
by way of illustration and not of limitation. Many 
substitutions, modifications, additions or rearrangements may 
be made within the scope of the invention, and the invention 
includes all such substitutions, modifications, additions or 
rearrangements . 
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[0025] 



i 

[0026] 

I [0027] 

j 

I 
I 

I 

' [0028] 



[0029] 



[0030] 



[0031] 



BRIEF DESCRIPTION OF THE DRAWINGS 

The drawings accompanying and forming part of this 
specification are included to depict certain aspects of the 
invention. A clearer impression of the invention, and of the 
components and operation of systems provided with the 
invention, will become more readily apparent by referring to 
the exemplary, and therefore nonlimiting, embodiments 
illustrated in the drawings, wherein identical reference 
numerals designate the same components. Note that the features 
illustrated in the drawings are not necessarily drawn to 
scale - 

FIGURE 1 includes an illustration of a prior art reduced 
Iceypad . 

FIGURE 2 includes a representation of one possible 
architecture for implementing the disclosed systems and 
methods . 

FIGURE 3 includes a representation of one embodiment of a 
layout for employing zones with a principle sensor. 

FIGURE 4 includes another representation of an embodiment of a 
layout for employing zones with a principle sensor. 

FIGURE 5 represents one embodiment of a layout for employing 
zones for data entry on a device. 

FIGURE 6 represents in more detail some of the zones depicted 
in FIGURE 5. 
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[0032] 



[0033] 



[0034] 



[0035] 



FIGURE 7 includes one metliodology for the processing of inputs 
by tlie core logic . 

FIGURE 8 includes one metliodology for formulating candidates 
wliicli may be used by the core logic during the processing of 
inputs . 

FIGURE 9 includes one methodology of formulating predictions 
which may be used by the core logic. 

FIGURE 10 includes a representation of an embodiment of a data 
structure for storing a language model . 
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DESCRIPTION OF PREFERRED EMBODIMENTS 
[0036] The invention and the various features and advantageous 

details thereof are explained more fully with reference to the 
nonlimiting embodiments that are illustrated in the 
accompanying drawings and detailed in the following 
description. Descriptions of well known starting materials, 
processing techniques, components and equipment are omitted so 
as not to unnecessarily obscure the invention in detail. 
Skilled artisans should understand, however, that the detailed 
description and the specific examples, while disclosing 
preferred embodiments of the invention, are given by way of 
illustration only and not by way of limitation. Various 
substitutions, modifications, additions or rearrangements 
within the scope of the underlying inventive concept (s) will 
become apparent to those skilled in the art after reading this 
disclosure . 



[0037] Reference is now made in detail to the exemplary embodiments 
of the invention, examples of which are illustrated in the 
accompanying drawings. Wherever possible, the same reference 
numbers will be used throughout the drawings to refer to the 
same or like parts (elements) . 

[0038] A few terms are defined or clarified to aid in an 
understanding of the terms as used throughout the 
specification. 

[0039] The term ''character" is intended to mean any symbol 

whatsoever, a character may be a letter, a diacritical mark, a 
portion of a kanji, any symbol with a connotation in the art 
of writing and printing, etc. 
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[0040] 

[0041] 
[0042] 

[0043] 

[0044] 

[0045] 

[0046] 



Tlie term "Icey" is intended to mean a zone of an interface, 
responsive to a user action, including toucli. Semantic meaning 
may be associated witli a Icey or portions of a Jcey, tliis 
semantic meaning may consist of one or more cliaracters or 
control information , 

The term "Iceyboard" is intended to mean a grouping of Iceys. 

Tlie term ''prediction" is intended to mean the results of an 
attempt to predict tlie completion of a sequence of symbols. 
Tliis prediction may be based on any type of system, including 
probabilistic, matliematical , linguistic, or any other. 

The term ''synonym," as used with respect to a use model, is 
intended to mean a possible alternative to a prediction. 

These synonyms may be associated with the prediction, and may 
be stored in a data structure corresponding to a prediction, 
or the terminal node corresponding to a prediction. 

The term "proposition" is intended to mean a sequence of one 
or more characters, or a set of these sequences. 

The term "prefix" is intended to mean a sequence of one or 
more characters which may or may not be extended with a user 
input, the results of disambiguation or prediction etc. 

The term "candidate" is intended to mean a sequence of one or 
more characters and an associated score. This score may be 
based on any model including probabilistic, mathematical, 
linguistic, or any other. 
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[0047] 



[0048] 



[0049] 



[0050] 



[0051] 



[0052] 



[0053] 



The term "active zone" is intended to mean a zone of an 
interface wtiicli is designated to be part of an interface used 
to receive input from a user. 

Tlie term ''zone" is intended to mean an area of an interface or 
witliin the purview of a sensor. 

The term "ambiguous" is intended to mean that something may- 
have several possible meanings. 

The term ''completion" is a set of symbols which may form a 
desired final or intermediate sequence of symbols when 
concatenated with an initial set of symbols (usually referred 
to as prefix) . In many circumstances a completion may have 
been calculated to have a high probability of forming the 
desired final or intermediate sequence. 

The term "send a message" is intended to mean transmission or 
exchange or utilization of information. It is important to 
note that no assumption may be made about how a message is 
sent, or even if the sending of a message is necessary in any 
particular embodiment. 

The term "sequence" is intended to mean a finite number of 
elements with a notion of order. A set may be empty or 
contain only one element. This is a definition commonly 
accepted in mathematics. We note it as (ei,...,en) 

The term "set" is intended to mean a finite number of elements 
with no notion of order. A set may be empty or contain one 
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element. This is a definition commonly accepted in 
mathematics. We note it as {ei,...,en}- 

[0054] The term "type" is intended to mean a range of values or 

classes. Typing a variable or a value is to enumerate it, or 
specify the values or classes it may be. Typing an element E 
with a type T is written as followed: ''E:T" 



[0055] The term "structure" is intended to mean an ordered set of 

elements (ei,...,en) . Each element ei may be typed with the type 
ti. The type of the structure is then the sequence (tl,...,tn) . 

[0056] The term "field" is intended to mean an element of a 

structure. Fields can be named. P(e:t) implies the structure 
P has a field e of type t. If p is of type P (as above), p.e 
represents the field of the element p. 



[0057] Attention is now directed to apparatuses, methods and systems 
for the efficient input of text on a device with a restricted 
set of input channels. These apparatuses, systems and methods 
may employ a user interface which allows efficient one handed 
operation by using a key layout that may be manipulated by the 
thumb of a single hand. This interface may be tailored to the 
specific device on which it is utilized, and in some 
instances, multiple characters may be associated with each key 
or portion of a key. To assist in the efficient entry of 
data, these apparatuses, systems and methods may disambiguate 
the user input from these keys in the case where data entered 
by the user may have several interpretations, or may provide 
an efficient way for a user to specify the input exactly. 
These systems and methods may also predict user input and 
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submit these predictions to the user so they may be selected 
by the user, instead of requiring the user to enter the entire 
text manually- These predictive systems and methods may also 
adapt to the proclivities of a certain user, and incorporate 
user defined abbreviations and synonyms for frequently used 
inputs . 

[0058] In an illustrative embodiment of the invention, the computer- 
executable instructions may be lines of assembly code or 
compiled C'^"*', Java, or other language code. Other 
architectures may be used. Additionally, a computer program 
or its software components with such code may be embodied in 
more than one computer system readable medium in more than one 
computer. 

[0059] Turning now to FIGURE 2, one embodiment of an architecture 200 
for implementing and organizing the apparatuses, methods and 
systems disclosed is presented. After reading this 
specification, sJcilled artisans will appreciate that many 
other architectures may be utilized with embodiments of the 
present invention. In one embodiment, these architectures 
include principal sensor 210, additional sensors 212, 
keystrolce module 220, semantization module 240, core logic 
250, language model 260 and usage model 270. KeystroJce module 
220 may receive data from principle sensor 210. Keystrolce 
module 22 0 processes this data and sends the results to 
semantization module 240. Semantization module 240 talces the 
results from Iceystrolce module 220 and from any additional 
sensors 212. Semantization module 240 may then process this 
data to formulate data for use by core logic 250 using use 
context 290, text element 280, menu element 282 and output 
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devices 284. Core logic 250 may resolve ambiguity or 
predictively complete a user's input using input from 
semantization module 240, language model 260 or usage model 
270. Core logic 250 may present its results to semantization 
module 240. Semantization module may then output results 
through a combination of use context 290, text element 280, 
menu element 282 and output devices 284. 

[0060] In general, this architecture is designed to enhance a user's 
ability to input data on a device by allowing a wide variety 
of configurations for an input device, including a 
configuration designed for use by the thumb of one hand. 
Embodiments may also disambiguate input, allow a user an 
efficient way to unambiguously specify the exact input, and 
submit predictive completions to a user in order to avoid 
additional manual input. A person of ordinary skill in the 
art will realize, after reading this disclosure, that this 
architecture is presented for descriptive purposes only; and 
that all the functionality described may be embodied in a wide 
variety of ways, including with more or fewer modules, with 
messages or without messages being passed, in software, in 
hardware, etc . 

[0061] In specific embodiments, a user's actions are captured by 

sensors 210, 212 and reported to the semantization module 240. 
In some embodiments, information from principle sensor 210 may 
be processed by keystroke module 220 before being reported to 
semantization module 240. Semantization module 240 may be the 
only interface to a user, and may be responsible for 
translating and formatting all incoming and outgoing messages. 
The translation of semantization module 240 may be context 
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dependent or may provide parameterization for integration into 
a system. Core logic 250 may contain rules for managing the 
interaction between semantization module 240 and a user, 
including prediction of user input and resolution of 
ambiguity, 

[0062] Core logic 250 may use data structures, including language 

model 260 for disambiguating user input, and user model 270 to 
predict user input. Text element 2 80, menu element 282, and 
other optional output devices 284 may be used by semantization 
module 240 to show the results of its logic or the state of 
the system. Text element 280 may be a data entry interface 
whereby a user may insert or input a sequence of symbols, or 
extend an existing sequence of symbols, and through which a 
sequence of symbols may be displayed. 

[0063] In one particular embodiment, text element 280 is the device 
by where a user may insert a sequence of symbols or extend an 
existing sequence. Text element 280 can send a message 
(EditText) to the system or semantization module 240, 
containing information on a possible prefix or other user 
input text. Text element 280 then receives a SubstituteText 
message from semantization module 240 containing a sequence of 
symbols (S) in response to the EditText message. Text element 
280 may also receive a EndEdit message, signaling the 
completion of the EditText message when a user has finished 
entering a sequence of symbols. When text element 280 
receives this SubstituteText message, text element 280 may 
respond accordingly. If it is the first SubstituteText 
received since the last EditText was sent and the last 
EditText had a prefix, the SubstituteText argument is 
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substituted for the sequence of symbols relative to the prefix 
in text element 280, which is then displayed to the user. If 
this is the first SubstituteText received since the last 
EditText was sent, and the last EditText had no prefix, text 
element 280 may just insert the SubstituteText argument in the 
sequence displayed to the user. For two consecutive 
SubstituteText messages without an EndEdit between them, it 
replaces the text of the first SubstituteText messages by the 
argument of the second and displays the information to the 
user. 

[0064] Menu element 282 may present user with a series of choices or 
options, depending on an output or state of the system. Menu 
element 282 can be combined with text element 280 in certain 
embodiments. Menu element 282 may display a list of choices 
or a tree of choices, for selection by a user. Menu element 
282 may receive SetMenu messages from semantization module 240 
containing an argument. This argument may be a flat list of 
choices when the menu is a list, or an argument referring to 
'the position of choices in a tree or a nested list of choices, 
in the case of a tree menu. The choices may be sequences of 
symbols or constants. When the user selects a choice from the 
list presented, a Selectltem message may be sent to 
semantization module 240 with the value of the choice as an 
argument . 

[0065] Use context 290 may control properties and layouts of sensors 
210, 212 and notify the user of certain events. In one 
embodiment, use context 290 may use a Text Input Query to 
prepare text element 280 or menu element 282 for interaction 
with a user. Text element 280 can respond to by emitting an 
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EditText message with a QueryFlag to semantization module 240, 
indicating that text element 280 is ready for editing. 

[0066] Use context 290 may also be aware when text editing is 

performed in text element 280 through the interception of 
StartEdit messages when editing starts, or EditStop message 
when editing stops. Use context 290 may also control various 
properties of principle sensor 210, and notify a user of 
certain events . 

[0067] These events are triggered through a UserNotify message which 
may contain information about the type of notification. The 
system may interpret the action to undertalce for each type of 
notification. This interpretation may be done by 
semantization module 240. 

[0068] Principle sensor 210 may monitor a set of variables in an 

environment and send the results through discrete messaging to 
keystroJce module 220. Additionally, principle sensor 210 may 
be capable of detecting actions of a user, which may consist 
of a user designating a particular area within the range of 
principle sensor 210, a sequence of movements made by a user, 
and the user's cessation of movement. These actions usually 
involve direct physical contact between a user and the sensor, 
indirect contact through the use of a tool, such as a stylus, 
pen, etc. or methods not involving physical contact such as 
eye movements, laser pointer, etc. Those of ordinary slcill in 
the art will realize that a wide variety of options exist for 
implementing principle sensor 210. Examples of these options 
include eye tracking systems, touch screens, joysticks, 
graphical user interfaces, traditional keyboards in all their 
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electronic or mechanical forms and the like. Similarly, these 
sensing device (s) may be implemented in hardware, software, or 
a combination of the two. It will also be apparent, after 
reading this disclosure, that a wide variety of layouts may be 
utilized with principle sensor 210 to provide an interface for 
data entry. 

[0069] Before delving into the interaction of the components of 

architecture 2 00, one example of a layout for use with a type 
of principle sensor 210 which may detect contact is presented 
in FIGURE 3. Within principle sensor 210, a set of non- 
overiapping zones 300-390 may be defined. These zones 300-390 
may be key zones 300, 310-390 designed to aid interaction with 
a user. These key zones 300, 310-390 may be arranged along 
one or more rows 312, 314, each row containing one or more of 
key zones 300, 310-390 and the rows may in turn be arranged in 
concentric curves. The zone which lies outside of the rows 
312, 314 of key zones 300, 310-390 is nokey zone 302. Each 
key zone 300 310-390 is associated with a unique identifier, 
the unique identifier representing the ordering of the key 
zones 300, 310-390 by means of a key zone's 300, 310-390 
position among the rows and between the rows. 

[0070] For example, rows 312, 314 may be numbered from smallest to 
larges starting with the innermost row. In the example 
depicted in FIGURE 3, row 314 would be row number 1 while row 
312 would be row number 2. Key zones 300, 310-390 may also be 
ordered within their respective rows. In FIGURE 3, numbers may 
be assigned to key zones 300, 310-390 from left to right, 
consequently zones 300, 350 may be numbered 1, zones 320, 360 
number 2 and so on. Using these row and zone orderings, a 
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unique identifier may be assembled for each zone implemented 
utilizing principle sensor 210. In the current example, key- 
zone 300 would be (2,1), key zone 350 may be identified as 
(1,1), key zone 360 as (1,2) etc. As can be easily seen by 
those of ordinary skill, many schemes may be utilized to 
uniquely identify the set of zones 300-390 within principle 
sensor 210, and these zones 300-390 may be defined in a 
variety of topologies which may be suited to the size of a 
particular sensor, device, user or any other criteria. 

[0071] Principle sensor 210 may monitor the environment and determine 
a pointed zone or contact zone where contact is being made by 
a user. This pointed zone may consist of only the point or 
location where contact is made or may consist of a defined 
area surrounding the location of actual contact. When contact 
is made, principle sensor 210 may determine if zones 300-390 
overlap the pointed zone. For each zone 300-390 which overlaps 
the pointed zone, principle sensor 210 sends a message to 
keystroke logic 220. Each message may be a message which 
identifies the start of contact (StartContact) and the 
respective unique identifier for the overlapping zone 300-390. 
When the pointed zone is moved, and begins to overlap any 
other zone 300-390, a message is sent to keystroke logic 220 
indicating a zone 300-390 has been entered along with the 
unique identifier of that zone 300-390. When the contact 
ceases, principle sensor 210 senses that contact has been 
stopped, and indicates this to keystroke module 220 
(StopContact) . Therefore, principle sensor can sense a start 
of contact, movement of this contact, and release of the 
contact. Messages of this type may be exchanged between 
principle sensor 210 and keystroke module 220 using a message 
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of the type described above. It is to be noted that though all 
types of contact may be determined by principle sensor 210, 
messages regarding all of them may not be passed to keystroke 
module 22,0, or used in evaluating the user's contact. 

[0072] Zones 300-390 may be designated as active or inactive 
depending on the layout of zones 300-390, the user's 
specification, the semantic meaning associated with zones 300- 
390 etc. When a zone 300-390 is designated as inactive, 
principle sensor 210 may not compare the area of principle 
sensor corresponding with the inactive zone with the pointed 
zone, and therefore no messages containing the inactive zone's 
unique identifier will be sent to keystroke logic 220. 
Similarly, a comparison may be made but no messages containing 
the inactive zone's unique identifier will be sent to 
keystroke logic 220. 

[0073] Keystroke logic 220 may serve to differentiate series of 
start/stop contact messages from principle sensor 210. 
Keystroke logic 22 0 may take the messages generated from 
principle sensor 210 and interpret these messages to determine 
the row number of a key zone 300, 310-390 in which contact is 
being made, and the number of the key zone 300, 310-390 in the 
row. Additionally, a key zone 300, 310-390 may be triggered 
in several modes depending on the movement of a user's contact 
or the user's release point, and keystroke logic may also 
determine the mode in which a key zone 300, 310-390 is 
triggered. These modes may be denoted by KeyCenter, KeyLower, 
KeyUpper, RowLower and RowUpper. These messages may 
correspond to the release point of a user, in relation to 
where contact by a user originally began. 
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[0074] In one embodiment, messages from principle sensor 210 may be 
processed according to the following algorithm. 

If the StartContact location is nokey, the keystroke module 
waits for the next StartContact message without any other 
activity. 

There are two variables: Row and Key. Each can take the 

values: Upper, Lower and Same 

For each StartContact (kri,ni) (where rl is the row to which 
the key belongs, and nl is the key number in the row) : 
Set the Row to Same and Key to Same . 
Then for each EnterZone (kr2/n2) : 
If r2>rl set Row to Upper 
If r2<rl set Row to Lower 
If r2=rl: 

Set Row to Same and 

If n2>nl set Key to Upper 
If n2<nl set Key to Lower 
If n2=nl set Key to Same 
For StopContact : 

If Row is equal to same: 

If Key is equal to Same emit Input (rl, nl, 
KeyCenter) 

If Key is equal to Lower emit Input (rl, nl, 
KeyLower) 

If Key is equal to Upper emit Input (rl, nl, 
KeyUpper) 

If Row is equal to lower emit Input (rl, nl, 
RowLower) 

If Row is equal to upper emit Input (rl, nl, 
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RowUpper) 

[0075] Turning now to FIGURE 4, another layout for use with 

embodiments of principle sensor 210 is depicted. While the 
layout described with respect to FIGURE 3 is perfectly 
adequate, when many zones 300-390 are present it may be 
difficult to determine the location of a user contact when the 
contact is close to the border of two zones. To remedy this 
difficulty, layout 400 on principle sensor 210 may include 
interkey zones 400-470 to implement a latching mechanism. 
Interkey zones 400-470 may be interleaved with key zones 300, 
310-390 such that no two key zones 300, 310-390 are 
contiguous. These interkey zones 400-470 may also have unique 
identifiers, and principle sensor 210 may sense when contact 
takes place in an interkey zone 410-470 and send a 
corresponding message to keystroke logic 220. When interkey 
zones 400-470 are present the messages from principle sensor 
210 may be processed according to the following algorithm. 

[0076] The row number x is now constituted of zones with the labels: 
{kx,i / ix,i # kx,2 / ix,2 ix,y-i , kx,y) , where y is the number 

of keys of row number x. If the StartContact location is 
nokey or an interkey zone, the keystroke module waits for the 
next StartContact message without any other activity. 
For each StartContact (kri,ni) : 
Set Row to Same and Key to Same 
For each Enter Zone {ir2,n2) 
If r2=rl then 

If n2<nl-l set Row to Same and Key to Lower 
If nl>nl set Row to Same and Key to Upper 
If r2<r set Row to Lower 
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If r2>r set Row to Upper 
For each EnterZone (kr2,n2) 



If r2=rl then 



If n2<nl set Row to same 



If n2=nl set Row to same 



and Key to Same 
and Key to Lower 



If n2>nl set Row to same 



and Key to Upper 



If r2<rl set Row to Lower 



If r2>rl set Row to Upper 
For the first StopContact 

If Row is equal to same 



If Key is equal to Same emit Input 

(rl , nl , KeyCenter) 
If Key is equal to Lower emit 

Input (rl , nl , LeyLower) 
If Key is equal to Upper emit 
Input (rl , nl , KeyUpper) 
If Row is equal to Lower emit Input (rl , nl , RowLower) 
If Row is equal to Upper emit Input (rl , nl , RowUpper) 



[0077] It will be readily apparent to those of ordinary skill in the 



art that interkey zones may be employed to implement a 
latching mechanism between any two zones or sets of zones. 

For example, an interkey zone may be employed to implement a 
latching mechanism between rows 312, 314. 



[0078] In FIGURE 5 a detailed layout of a user interface 500 for use 
with embodiments of the present invention is presented. User 
interface 500 may contain principle sensor 210 and text window 
510 for displaying information to the user. Principle sensor 
210 may contain a set of zones 520-542. Each set of zones may 
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present a set of symbols to the user. One set of zones 520- 
528 may correspond to characters of the English language, 
while other zones 530-542 may correspond to other functions 
depending on the context or architecture of the system with 
which user interface 500 is being utilized. These other 
functions may include command functions or menu functions. 

[0079] FIGURE 6 depicts with more particularity the set of zones 52 0 
528 which are associated with the English language. Zones 
520-528 may be laid out in a curve, such that zones 520-528 
may be actuated with the thumb of the hand holding the device 
on which user interface 500 is utilized. In the embodiment 
depicted, zones 520-528 are arranged so they may be more 
efficiently actuated by a right-handed person. However, as 
discussed above, the landscape of the keys may be altered to 
any desired format, including formats which allow efficient 
actuation by the left hand. Each zone 520-528 contains up to 
six pairs of letters, grouped by key into upper pair 610, 
center pair 620 and lower pair 630. A user may actuate zones 
520-528, principle sensor 210 will detect this actuation, pas 
messages corresponding to this actuation to keystroke logic 
22 0, which will in turn translate these message to an Input 
message. Semantization logic 240 will then associate this 
Input message with the proper pair of letters. 

[0080] For example, to select center pair 620 a user may touch the 
zone with the desired pair and release. To select any lower 
pair 630 a user may touch the zone with the desired pair and 
slide to any zone in the direction of the pair desired, the 
converse is true for selecting upper pair 610. To illustrate 
this example further, suppose a letter of the pair (s,h) is 
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desired. This selection may be accomplished by touching the 
zone 526 containing (g,m) and sliding the movement until the 
pair {b,j). Note that the movement may stop in any zone 520- 
524 below zone 52 6 containing the desired pair (s,h) . 
Semantization logic 240 may still resolve this movement into 
the desired pair (s,h) . 

[0081] Semantization logic 240 may receive a message from keystroke 
module 220 or additional sensors 212, and translate this 
message to a set of symbols or a set of commands depending on 
the context in which the message is received. In one 
particular embodiment, semantization logic 240 receives an 
Input message from keystroke logic 220 and translates this to 
a set of possible of symbols in accordance with a context. 
For example, a user actuation as described above with regards 
to FIGURE 6 may be resolved to the letter pair (s,h) . This 
resolution may then be passed to core logic 250. 
Semantization logic 240 may also be responsible for 
coordinating the input and output of the system in accordance 
with the context of the system. 

[0082] In more specific embodiments, semantization module 240 may be 
responsible for translation of input messages to symbols or 
commands. This translation may be context dependent; the part 
of the context needed by semantization module may be called 
SemantifyContext . Semantization module 240 may respond to an 
input message representing symbols according to the context 
(SemantifyContext) , with a SymbolMessage containing a set of 
symbols . 

[0083] In the case where an input message represents a command, a 



I 
I 
I 
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message (CommandMessage) is emitted containing the unique 
identifier of the command. This unique identifier information 
can take a value which corresponds to the command desired, 
including: BackSpaceCmd, EndLexiconCmd, ItemSelectionCmd, 
NextCharCmd and PrevCharCmd. If the input message represents 
nothing with respect to the context (SemantifyContext) , no 
action may be taken. 

[0084] An input message can represent both a command and a set of 

symbols, in such a case a SymbolMessage may be sent out before 
the CommandMessage in order to display the set of symbols. 
Semantization module 240 may also be responsible for 
configuration of sensors 210, 212. 



[0085] In some embodiments, messages from text element 280, menu 
element 282 or user context 2 90 are handled as follows: 



[0086] A Selectltem triggers the emission of a CommandMessage with 
ItemSelectionCmd as command identifier and the argument of 
Selectltem. 

[0087] If the last EndEdit message is more recent than the last 

StartEdit, the reception of a StartEdit triggers an EditText 
message. If the last StartEdit message is more recent than 
the last EndEdit, the reception of an EndEdit triggers the 
emission of a CommandMessage to the core logic 250 with 
argument EndLexiconCmd. 

[0088] If the semantization module 240 is about to send a 

SymbolMessage when the last EndEdit message is more recent 
than the last StartEdit, semantization module 240 suspends the 
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sending of the SymbolMessage, sends a message Text Input Query 
to user context 290. If the next message received is EditText 
associated with a QueryFlag, semantization module 240 forwards 
the EditText message to core logic 250, and sends a suspended 
SymbolMessage . 

[0089] A SemantifyContext may be the incremental result of a 

Context Generation function called with the last received 

message (either from architecture components or external) and 
the previous SemantifyContext as arguments. Between any two 
messages the SemantifyContext remains consistent. 

[0090] Turning now to core logic 250 employed by semantization module 
240, FIGURE 7 depicts the basic methodology employed by core 
logic 250. Core logic 250 may take a set of inputs (block 
710) , apply processing rules (block 720) and return output 
(block 730) if necessary. Core logic 250 may take a set of 
inputs (block 710) , these inputs may be generated internally 
by core logic 250, or may be from a user through principle 
sensor 210. Inputs may also be received by core logic 250 
(block 710) from semantization module 240. As described above, 
in certain circumstances, semantization module 240 may 
generate a set of messages, depending on messages it has 
received and the context in which it is operating. 
Semantization module 240 may pass these messages to core logic 
250, which may in turn take these messages as a set of inputs 
(block 710) . In some embodiments, this input may be messages 
representing symbols or commands. Additionally, these messages 
may contain a prefix and a sequence of symbols related to the 
prefix. 
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[0091] Core logic 250 may take this set of inputs (block 710) and 

process them (block 720) according to a specific set of rules. 
This processing may include the formulation of a proposition, 
which may be a symbol, word, arbitrary sequence of characters, 
or any combination. This proposition may be associated with, 
or derived from, the disambiguation of an input, a predictive 
completion or prediction, or some combination of the two. 

[0092] The rules which core logic 250 uses to process the set of 

inputs (block 720) may be designed to utilize language model 
260 and usage model 2 80 to produce a set of candidates or 
predictions, and evaluate these candidates or predictions to 
generate a set of propositions, one or more of which may be 
presented to a user. In one embodiment, core logic 250 uses a 
scoring system to process the input (block 720) . When core 
logic receives an input, if the input has no prefix, one or 
more propositions may be generated and evaluated according to 
a scoring system. However, if a prefix is present a sequence 
may be generated by appending the new input to the prefix and 
attempting to generate and evaluate one or more candidates 
based on this sequence. If no candidates can be generated, or 
the candidates generated do not surpass some threshold score a 
method may be provided to specify an input, otherwise the 
candidates are evaluated, predictions made, and a proposition 
may be formed and presented to a user based on this 
evaluation. Propositions may also be formulated in light of 
the disambiguation of the original input. 

[0093] In certain embodiments, core logic 250 may formulate a 

candidate based on the input, determine a prediction from one 
or more of the candidates and formulate a proposition to 
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present to the user from the set of candidates and predictions 
generated. 

[0094] In some embodiments, rules and algorithms utilized by core 
logic 250 may be grouped into three categories, guessing 
rules, unambiguous rules, and internal rules. Unambiguous and 
guessing rules may be the main driver for core logic 250, 
while the internal rules may serve as the control 
functionality behind core logic 2 50, and may provide 
functionality responsible for the use of the guessing and 
unambiguous rule sets, the evaluation of these two rule sets, 
or responses from these rule sets. Additionally, the internal 
rules may be responsible for creating a prefix or proposition 
to be shown to a user. 

[0095] These internal rules may worlc in tandem with the unambiguous 
rules and guessing rules depending on the functionality 
desired and the context in which the rules are being utilized. 
These rules may describe the behavior of the system in various 
contexts, and may be employed based on user desire or internal 
status. The unambiguous rules may describe behavior when the 
input received either from the user, or from semantization 
module 240, is ambiguous or there is no adequate candidate. 
In certain embodiments, these unambiguous rules employ only 
language model 260. Guessing rules may describe behavior when 
a relevant candidate can be found, and in embodiments may 
employ both language model 260 and usage model 280. 

[0096] In one particular embodiment, core logic module 250 receives a 
CommandMessage or SymbolMessage from semantization module 240. 
Core logic 250 responds to those messages in accordance with a 
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set of rules. When the core logic 250 rules indicate 
communication with a user, the semant izat ion module 24 0 is 
used for delivery of the corresponding messages used for 
interface with the user, 

[0097] In this embodiment, the rest of the core logic 250 rules are 
organized under three headings: 1) Guessing rules, which 
describe the core logic 250 behavior when the system has 
relevant propositions about input; 2) Unambiguous rules which 
describe how the system behaves when an input is ambiguous and 
the system can guess no relevant proposition about the input; 
and Internal rules, which contain several functions necessary 
to the evaluation of the guessing in the unambiguous rules. 

[0098] Each of those rules is further detailed below. 
The variables common to all the rules are: 

CurrentRun a sequence of symbols. 

CurrentPropositionSet a set of Propositions which may be 
stored in a data structure, which may be represented in code 
as : 

typedef struct { 

Char* run; 

Int count : } Proposition; 

InputSeqs is set sequence of set symbols. 
EditionState can take two values: GuessingState and 
Unambiguous St ate . 

FinalSeq is a sequence of symbols or Undefined. 
CurrentSym is an integer. 
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[0099] When the core logic 250 receives an EditText message, it 
responds by: 

Sending a StartEdit message to the text element. 
If EditText prefix is empty then 

CurrentPropositionSet is set to an empty set; 

InputSeqs is set to an empty set; 

The rule ShowPropositions is evaluated; 

Else 

NewPrefix is evaluated with argument the EditText 
prefix; 

if CurrentPropositionSet is empty 

EditionState is set to UnambiguousState; 
InitUnambiguous is evaluated; 

Else 

rule ShowPropositions is evaluated; 

[0100] This algorithm initializes the state of core logic 250. for new 
input. The initiation is simple if there is no prefix. When 
a prefix is provided, the algorithm is designed to attempt to 
lookup the longest partial string being a possible completion 
of the prefix, utilizing language model 260. If language 
model 260 does not contain such a partial string, the 
algorithm switches to an unambiguous input mode. 

[0101] i. Guessing Rules 

The guessing rules apply when EditionState=GuessingState . This 
section of rules has a unique constant Add. There is a rule 
for each type of message arriving to core logic 250: 

Syinbo message has as list of symbols S= (Si,..., St) with 
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set InputSeqs to the concatenation of InputSeqs 
and (S) 

if CurrentRun is not empty, evaluate 

newPrefix( Current Run) and set CurrentRun to empty 

if CurrentPropositionSet is empty 

EditionState is set to UnambiguousState; 

InitUnambiguous is evaluated; 

Else 

If CurrentPropositionSet is empty then: 
L is the maximal set of non identical 
elements of type Node {Li,„.,Ln} where 
for each i part of {l,..,n}: 

Li.c=Sj with j part of {l,..,t} 
IsInSubtree (Li . location, root) 
P is the minimal set of proposition (Pi/..wPm) 
such as each i part of {l,..,m}: 

Pi.run= (Li.c) and Pi . count =Li . count 
PF=FilterProposition (P) 
If PF is empty then: 

set the EditionState to 
UnambiguousState 
evaluate InitUnambiguous 

else : 

set CurrentPropositionSet to PF 
evaluate ShowPropositions 
emit a UserNotify message with click 
as argument . 

else : 

The currentProposiontSet is the set {CPi, CPm} . 
TempLocationSet is the maximal set of not 
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identical Node elements 

{ (Li,Xi, Yi,Counti) (Ln,Xn, Yn,Countn) } where for 
each i part of {l,..,n}: 

Li. c=Sj where j part of {l,...,t} 
Yi is CPj with j in {l,m} 
Xi is the longest sequence of Node 
(Ni,...,Nin) where : 

(Ni.c,..,,NTn-c) forms a suffix not null 
of Yi.run 

IsInSubtree (Ni, root) 
IsInSubtree (Li,Nni) 

Counti=ExtendProposition (Li, CPj . count) 
PF is the sequence of propositions (PFPi, PFPn) 
where for each i in {l,...,n}: 

PFPi.run is equal to the concatenation of 

Yi.run and Li.c 
PF2=FilterProposition (PF) 
if PF2 is empty then: 

set the EditionState to UnambiguousState 

evaluate InitUnambiguous 

else 

Current PropositionSet is to the set PF2 

where for each i in {l,...,n}: 

e va lua t e ShowPr opos i t i ons 

emit a UserNotify message with click as 

argument . 



Note: These rules for SymbolMessage may process user input, 
and try to find out which symbol sequence the user wants to 
input based on all the information received since the last 
EditText message, regardless of whether the user input is 
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ambiguous or not . 

CommandMe s s age 

Depending on the command identifier contained in the 
message, the following actions are taken: 
BackSpaceCmd . 

If InputSeqs is an empty sequence 

an EditStop message is sent to the UseContext 

else 

Replay is the sequence 
(EditTextO , SymbolMessage (Si) 

SymbolMessage (Sn-2) ) along the input secjuence 
(Si, . . , Sn) . CurrentPropositionSet and InputSeqs 
are set to empty sets. The CoreLogic sends only 
to itself one after the other the elements of 
Replay as an argument of SymbolMessage and 
prevents himself to emit messages to other 
modules. Finally a SymbolMessage (Sn-i) is sent. 

Note: These rules may make core logic 250 return to the state 
it was in before the last undiscarded SymbolMessage occurred. 

EndLexiconCmd 

If CurrentRun is equal to Add 

Set the EditionState to UnambiguousState . 
Eva luate Init Unamb i g uou s 

Else 

if CurrentRun is empty and 

CurrentPropositionSet is of the form (P,...) set 
CurrentRun to P. run 

Send a SubstituteText with argument CurrentRun 
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An EditEnd message is send to the user context 
and to the Semantization module , 
Evaluate the StoreWord rule with as argument 
Current Run 

Note: If the user had selected a suggestion (Add) in the menu 
this rules switches core logic 250 rules to unambiguous mode 
else this rule cleans up the environment and takes account of 
the symbol sequence composed since the last StartEdit by the 
user. 

I t emS e 1 ec t ionCmd 

If the highlighted item in menu is the command Add 
then: 

Set Current Run to Add 

Send a SubstituteText with argument P.c , while 
CurrentPropositionSet is (P ...) 

Else : 

Set the CurrentRun to the argument of 
PropositionSelect 

Send a SubstituteText message with the 
CurrentRun to the text field. 
Evaluate ShowRules 

Note: The user may select one of the propositions 
as the input he wants to appear in the 
TextElement . 

102] Unambiguous Rules 

The rules indexed under Guessing rules apply when 
EditionState=UnambiguousState . 
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In i tUnamb i guou s 

Emit UserNotification (Error) 
set CurrentSym to 1 

InputSeqs has the form (Si,...,Sn), set FinalSeq to 
a sequence of n elements 

Undefined 
evaluate ShowUnambiguous 

send StartAmbiguousInput to the Semantify module 

Symbo iMe s s age 

has as sequence S as argument 

The InputSeqs has the form {Si,...,Sn)i set 
InputSeqs to the sequence (Si, Sn^ S) 
The FinalSeq has the form (Fl , Fn) , set FinalSeq 
to the sequence (Fl Fn, Undefined) 
evaluate ShowUnambiguous 
CommandMe s s age 

BackSpaceCmd 

If InputSeqs is of the form (S) : 

Send SubstituteText with as argument an empty 
sequence . 

Send an EditEnd message to the user context and 
to the Semantify module . 

Else: 

InputSeqs is of the form (Si,...,Sn), set 
InputSeqs to (Si, .-.,Sn-i) 

FinalSeq is of the form (Fi,...,Fn) , set FinalSeq 

to (Fi, ...,Fn-i) 

EndLexiconCmd 
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send a SubstituteText message with the 
CurrentRun to the text field 

An EditEnd message is send to the user context 

and to the Semantify module. 

evaluate the StoreWord rule with as argument 

CurrentRun 

I t emSel ec t lonCmd 

has as argument A. 

If A is equal to Done 

send a SubstituteText message with the 
CurrentRun to the text field 
An EditEnd message is send to the user 
context and to the Semantify module, 
evaluate the StoreWord rule with as 
argument CurrentRun 

Else : 

FinalSeq of the form (Fi, Fcurrentsym- 

1 / FcurrentSym # FcurrentSym+1 / / Fn) i S Set t O ( Fi , 
— # FcurrentSym-l / A, FcurrentSym+1 / — / Fn) 

set Current Sym to Current Sym+1 
Eva 1 ua t e S ho wUnamb i guou s 

PrevCharCmd 

if CurrentSym=l emit UserNotify (Error) else 
Current Sym=CurrenSym- 1 

NextCharCmd 

if CurrentSym note equal to the length of 
InputSeq emit UserNotify (Error) else 
Current Sym=Cur renSym+ 1 
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[0103] iii 



Internal Rules 



The rules described in this section are explicitly called when 
evaluating ''Unambiguous Rules" and ''Guessing Rules" 



ShowUnambiguous 

InputSeq is of the form 

( (Si,i,-,Si,M(i)) /-"/ (Sn,i,...,Sn,M(n)) ) where M(x) is the 
function that gives the length of x*^^ element of 
InputSeq. FinalSeq is of the form (Fi,...,Fn) where each 
Fi is equal to Undefined or one of the elements of 
{Si,i,.-., Si,M(i) ) . C is one of the sequences (Ci,...,Cn) 
where for i in {l,...,n}: 

If Fi ^ Undefined then Ci=Fi else Ci in 

(Si,i, Si,M(l) ) 

Quotationn maximal such: 

If Ci=L.c with IsInSubtree (L, root ) 

Quotationi=L- count else Quotationi=0 

I in {2,...,n} Quotationn=Accumulate (Quotationi- 

i,L. count) where L=NodeFromPath (Ci_j Ci) with J 

maximal 

set CurrentRun to c 

Send MenuSet message with argument 

( ScurrentSym, 1 / — / ScurrenSym, M (1) , / Done ) 

Accumulate and Invariant may be defined in an 
implementation dependent manner, which will be 
apparent to those of skill in the are after reading 
this disclosure - 

NewPrefix 
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Has as argument s = (Ci...Cr) 

Set InputSeqs to the sequence ( {Ci} , {Cr} ) 
P is the proposition such as: 

p.run= s, p.count= Invariant 
The longest sequence of (Ni,..., Nm) where: 
IsInSubtree (Ni,Ni+i) with i in [l,m-l] 
Nni=root 

For each Ni with i in [l,m-l] , Ni.c=Cr-m+i 

If P exists 

Set CurrenPropositionSet to FilterProposition ( {p} ) 
Else 

Set CurrentPropositionSet to empty 
ShowPropositions rule 

CurrentPropositionSet is of the form {Pi,...,Pn} 
UP is the set where for i in {l,...,n}: 
the sequence PredA 

( (PrAi,CAi,SymAi ),..., (PrAm,CAm,SymAm) ) is equal to 
Get Sugest ions (Pi. run) 

the sequence PredB the minimal sequence 

( (PrBi, CBi, SymBi) (PrBk, CBk, SymBk) ) with: 
for each j in {l,...,k-l} CBjaCBj+i 
for each j in {l,...,m} if Relevant (PrAj ) 
Adjust ( (PrAj , CAj , SymAj ) is part of PredB 

U is equal to GetUNodeFromPath (Pi . run) 

if U is equal to UnknownPath then 
((Pi. run, Pi.c)) is part of UP 

else 

(Adjust ( (Pi. run, Pi.c,U) ) , (PrBi, CBi, SymBi) , 
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(PrBk/CBk, SymBi) ) is part of UP 
if the menu is capable of showing a tree send SetMenu 
message with argument FilterSortExtentions (U) , else: 

U is of the form (( (DS , DC) ( (PS , PC) ( (SS , SC) ...)...) ...) 

U2 is the minimal list made of all the elements 

(DS,DC) , (PS, PC) and {SS,SC) from U sorted along 

the DC,PC,SC 

send SetMenu message with argument U2 

Note: These rules may show the user all the relevant 
propositions currently evaluated for the 
disambiguisation/prediction for his input. 

Adjust is implementation dependent and may relate the 
probabilities of predictions with the probabilities of 
the disambiguation propositions. This assures the user 
input is on average minimal . 

ExtendProposition and FilterProposition are functions 
which may be implementation dependent. Filter 
Proposition may adjusts the threshold scoring level 
which determines the suggestion which are shown to the 
user must be. A higher threshold lowers the number of 
irrelevancies in the suggestions shown to the user. 

StoreWord is also implementation dependent, and may be 
a holding place for a set of characters and may manage 
the amount of resources allocated to the LanguageModel 
and UsageModel structure. It may also control the 
models adaptation to the user's behavior to improver 
prediction/disambiguation for future use. 
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[0104] Moving on to FIGURE 8, one embodiment of a method for which 
core logic 250 may utilize language model 260 is depicted. 
Language model 260 may be an extension and improvement of the 
model depicted in an article by J.G. Cleary and J.H. Wit ten 
titled "Data Compression Using Adaptive Coding and Partial 
String Matching" published in Volume Com-32, Number 4 of the 
April 1984 issue of the IEEE Communications journal [Clearyl] , 
incorporated fully herein by reference. Core logic 250 may 
employ language model 260 to receive a prefix or an input 
(block 810) , formulate a set of candidates (block 820) and 
rank the set of candidates (block 830) . Core logic 250 may 
receive a prefix and an input. A prefix may be an arbitrarily 
long set of symbols including an empty set, while an input may 
be an Input message of the type described with regard to 
semantization module 240. A set of candidates is then 
calculated (block 820) based on the prefix and the input. If 
the prefix is empty a candidate may be calculated, based 
solely on the input. If, however, the prefix is not an empty 
set, core logic 250 may attempt to lookup a set of string 
associated with the prefix and generate candidates (block 820) 
based on the prefix, 

[0105] In some embodiments, core logic 250 may formulate these 

candidates (block 82 0) using language model 2 60, which may 
employ a tree structure to store arbitrarily long partial 
strings, and associated scores. In one embodiment, core logic 
250 finds the longest string in language model 260 which is 
associated with the prefix. In associated embodiments, core 
logic 250 may use language model 260 to generate a list of 
candidates, this list may be a flat list or a nested list. 
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depending on the implementation. Core logic may determine the 
score to associate with a candidate based on the score 
associated with each node of the tree implemented in language 
model 260. These scores may in turn be based on a 
probabilistic model. 

[0106] It will be readily apparent to those of ordinary skill in the 
art that a wide variety of algorithms that may be used to 
calculate scores, and score candidates. For example, suppose 
the user wishes to enter the word ''test" . Using the layout 
described in FIGURE 6, a user may select the pair (t,z), core 
logic 250 may then calculate the probability of ''t" and the 
probability of ''z" . Next, the user will select the pair 
{e,f) . Core logic 250 may use language model to calculate the 
score of the letters ''e" and ''f" following each of the letters 
associated with the set of previous candidates (t, z) , and 
multiply it times the probability of each letter in the 
initial letter pair (t,z). In this manner, the probability of 
each sequence of two letters may be calculated. 

[0107] In one particular emlpodiment, the structure for storing 

language model 260 of this type, and associated logic for 
processing input and calculating probabilities may be 
implemented as follows. 

[0108] Language model 260 may be contained in structures of type 
LanguageModel represented as: 

Typdef struct { 

Char c ; 

Int counter; 
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Node S t rue t * next ; 

NodeStruct* subtree; }NodeStruct 

Typedef NodeStruct* Node; 

Node root; 

The Node structure may, in embodiments, be the structure of 
[Clearyl] , modulo the syntax. Root may be core logic's 
entry point to language model 260. 

109] The following properties and algorithms may be defined over 
the structure which supports language model 260: 

IsInSubtree (Node N2, Node Ni) 

If Ni. subtree is an invalid reference then 
IsInSubtree (N2,Ni) is false 

else 

If N2=Ni . subtree then 

IsInSubtree (N2,Ni) is true 

Else 

IsInSubtree (N2,Ni) =IsNeighbor (N2/N1. subtr 

ee) 

Note: This property is true when N2 is a son of Ni and 
false in other cases 

IsNeighbor (Node N2, Node Ni) 

If Nl.next is an invalid reference to a node then 
IsNeighbor (N2,Ni) is false 

Else 
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If N2=Ni.next then 



IsNeighbor (N2/N1) is true 



Else 



IsNeighbor (N2,Ni) =IsNeighbor (N2,Ni.next) 



[0110] 



[0111] 



Note: Tliis property is true wlien N2 is on the same 
level as Ni in the tree and false in other cases 

NodeFromPath ( (Si, . . , Sn) ) 

For each i in {l,..,n} Si is a symbol 
N is the sequence of nodes (Ni,..,Nn) where: 
for each i in {l,..,n} Ni.c=Si 

for each i in {l,...,n-l} IsInSubtree (Ni+i,Ni) is 
true 

IsInSubtree (Ni, root) 
if N exists NodeFromPath ( (Si, Sn) ) sets values to 
Nn else it sets values to UnknownPath 



FIGURE 9 depicts one embodiment of a method for which core 
logic 250 may utilize usage model 280. Core logic 250 may 
employ usage model 280 to receive a set of string (bloclc 910) , 
formulate a set of predictions (bloclc 920) and ranJc the set of 
predictions (block 930) . Core logic 250 may receive a set of 
symbols (block 910) which may consist of a set of strings, 
these strings may be a prefix and an input from a user. In 
some embodiments, these symbols may be derived or extracted by 
core logic 250 from a set of candidates assembled by, or 
returned to, core logic using language model 260. 

This set of strings may be then be used to formulate a set of 
predictions (block 820) based on a string from the set of 
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strings. Usage model 280 may determine a series of possible 
completions to the string based on statistical ordering of the 
possible completions . 

[0112] In one embodiment, usage model 280 may use a tree structure to 
represent words, with each node of a tree containing a 
character. Each node which may designate the end of word may 
be demarcated, and a score associated with such a terminal 
node. It will be readily apparent to those of ordinary skill 
in the art the variety of scoring systems which may be used to 
assign a score. 

[0113] To continue with the above example, suppose the string ^^te" is 
received by usage model 280. FIGURE 10 depicts one 
representation of a tree which usage model 280 may use to 
complete the string. The string ''te" may be used as index into 
a tree based representation of words 1000 which begin with 
^^te" , this tree 1000 may have terminal nodes 1010, 1020, 1030, 
1040 which may contain indicia that these nodes offer possible 
completions to the string ^'te" . This indicia may be a score 
associated terminal nodes 1010, 1020, 1030, 1040, or on some 
embodiments a boolean value. This score may be based on the 
number of nodes of the same depth which may complete the same 
string. The score may also be based on how many nodes in the 
same branch are not a completion for the string. 

[0114] Based on the score associated with a node the node may be 

ranked (block 830) and the sequence of characters associated 
with each node in the path through tree 1000 to the node with 
the highest (or lowest) ranking may be returned. Referring to 
FIGURE 10, if node 1020 has the highest score the sequence of 
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characters "tested" may be returned. In other embodiments, 
each sequence of characters associated with terminal nodes in 
paths through the subtree may be returned along with the score 
of the associated terminal node. 

[0115] Similarly, synonyms of the predictions, may be returned along 
with their associated scores. For example, in FIGURE 10 if 
node 1020 has the highest score the sequence of characters 
"tested" may be returned. Additionally one or more synonyms 
may be returned. These synonyms may be determined based on 
their associated scores, and may be returned with or without 
these associated scores. The synonyms associated with a 
particular prediction may be stored in the terminal node 
corresponding to a prediction. To continue the example, 
terminal node 1020 corresponding to the prediction "tested" 
may store a variety of synonyms of "tested" including 
"tester", "testes", "tesla" etc. These synonyms of "tested" 
may be returned along with the sequence of symbols 
corresponding to the prediction "tested" . In one particular 
embodiment, the structure for storing usage model 280 of this 
type, and associated logic for processing input and 
calculating scores or probabilities may be implemented as 
follows . 

Typedef enum{true, false} Boolean; 

Typedef struct { 

Char* s; 
Int count ; 

} SymListNode 
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Typedef struct { 



Char 



c; 



Unode [] 



sons ; 



Boolean 



terminal ; 



Int 



heads ; 



Int 



count ; 



SymListNode [] sym; 
} Unode 

uNode* Uroot; 

Uroot may be the entry point for core logic 250 into 
usage model 280. The following properties over usage 
model 280 may be defined: 



IsSon (Ui : UPNode , U2 : UPNode ) 

Is true if Ui part if U2.sons else 
IsSon(Ui,U2) is false 

Note: IsSon denotes if Ui is a direct son of U2 

GetUNodeFromPath ( (Ci , Cn) ) 

The sequence (UPi,...,UPn) : 

IsSon (UPi,URoot) is true 

for each i in {l,..,n-l} IsSon (UPi+i, UPi) 
is true 

for each i in {l,...,n} UPi.c=Ci 
If UPn exists GetUNodeFromPath { (Ci, Cn) ) 
values to UPn else values to UnknownPath 

Note: GetUpFromPath returns the UNode in the 
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UsageModel associated with the sequence given as 
argument if it exists, else the constant 
UnknownPath is return. 



GetSuggestions (C) 

C is a sequence of symbols 
UP is the result of GetUPFromPath (C) 
UPS is the minimal set of UPNodes 
(UPSi, UPSn) where for each i in {l,,..,n}: 
there is a sequence (Ti,...,Tq) where: 
Ti is equal to UP 
Tq is equal to UPSi q>i 
for each j in {l,...,q-l} 
• IsSon{Tj+i,Tj) 
UPSi . terminal is true 
S is the sequence 

( (Si, Count i,Symi ),..., (Sn/ Count n,Symi) where for 
each i in {l,...,n}: UPSx=GetUPFromPath (Si) , 
UPSx- count =Countj and Symj =UPSx . sym 
GetSugestion(C) values to S 

Note: GetSuggestions retrieves words and their 
synonyms of a given prefix. Each word and synonym 
is associated with a score denoting their 
frequency of use in usage model 280. 



116] Returning to FIGURE 7, core logic may then formulate an output 

(block 730) based upon the processing of the input (block 720) 

discussed above. This output may be a sequence of characters 
or a sequence of characters with a set of menu options. This 

output (block 730) may then be presented to a user for further 
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interaction. In one embodiment, core logic 250 may interact 
with semantization module 240 in order to present the 
formulated output (block 730) as a proposition to a user. 

[0117] Note that not all of the tables, fields, or steps described 
are necessary, that tables, fields or steps may not be 
required, and that further tables, fields or steps may be 
added in addition to those illustrated. Additionally, the 
order in which each of the activities is listed is not 
necessarily the order in which they are performed. After 
reading this specification, a person of ordinary skill in the 
art will be capable of determining which tables, fields and 
orderings best suit any particular objective. 

[0118] In the foregoing specification, the invention has been 

described with reference to specific embodiments. However, 
one of ordinary skill in the art appreciates that various 
modifications and changes can be made without departing from 
the scope of the invention as set forth in the claims below. 
Accordingly, the specification and figures are to be regarded 
in an illustrative rather than a restrictive sense, and all 
such modifications are intended to be included within the 
scope of invention. 

[0119] Benefits, other advantages, and solutions to problems have 
been described above with regard to specific embodiments. 
However, the benefits, advantages, solutions to problems, and 
any component (s) that may cause any benefit, advantage, or 
solution to occur or become more pronounced are not to be 
construed as a critical, required, or essential feature or 
component of any or all the claims. 
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